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Abstract 

In this paper we study the class of so-called privileged words which have been previously consid- 
ered only a little. We develop the basic properties of privileged words, which turn out to share 
similar properties with palindromes. Privileged words are studied in relation to previously 
studied classes of words, rich words, Sturmian words and episturmian words. A new characteriza- 
tion of Sturmian words is given in terms of privileged complexity. The privileged complexity of 
the Thue-Morse words is also briefly studied. 
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1 Introduction 

This work concerns a new class of words named privileged words which have previously been 
researched only a little. The motivation for defining these words comes from the research of so- 
called rich words [Gle+09] which are words having maximum number of distinct palindromes 
(thus the name, rich words are rich in palindromes). An important property of rich words is 
that a word is rich if and only if every complete first return to a palindrome is a palindrome. 
It's equivalent to say "every palindrome is a complete first return to a shorter palindrome". By 
a slight alteration of this condition we define privileged words: a word is privileged if it's a 
complete first return to a shorter privileged word. Moreover we need to define that the empty 
word and the letters of the alphabet are privileged. The effect of this modification is that every 
word is rich in privileged words, i.e. every word w has exactly \w\ + 1 distinct privileged factors 
whereas a rich word w has exactly \w\ + 1 distinct palindromes (there exist words which have 
strictly less palindromic factors). It turns out that privileged words and palindromes have some 
similar properties. This paper introduces the basic properties of privileged words, and questions 
regarding so-called privileged complexity of Sturmian words, episturmian words and the Thue- 
Morse word are studied. 

After introducing the notations and definitions, in Section 3 privileged words and their basic 
properties are presented. These basic results emphasize the analogue between palindromes and 
privileged words. Moreover privileged words are studied in relation to rich words. 
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Section 4 studies the number of distinct privileged factors in finite words. There's also discus- 
sion how privileged words fit into a recent work of G. Fici and Z. Liptak [FL12]. 

Various complexity functions of infinite words have been previously considered. In Section 5 
the notion of privileged complexity is defined. This section contains the main result of this pa- 
per: a characterization of Sturmian words using privileged complexity. As a by-product of the 
methods used in the proof of the main result, we obtain with little extra effort some previously 
known results, namely the fact that Sturmian words are rich, and partially a result of X. Droubay 
and G. Pirillo concerning the palindromic complexity of Sturmian words [DP99]. The section is 
concluded with a brief study of the privileged complexity of episturmian words. 

The last section studies briefly the privileged complexity of the Thue-Morse word. It's proven 
that the Thue-Morse word doesn't contain a privileged factor of odd length greater than three. 
However the even case is left open. Some numerical data and a conjecture are provided. 

2 Notation and Terminology 

In this text, we denote by A a finite alphabet, which is a finite non-empty set of symbols. The 
elements of A are called letters. A (finite) word over A is a sequence of letters. To the empty 
sequence corresponds the empty word, denoted by e. The set of all finite words over A is denoted 
by A*. The set of non-empty words over A is the setA + := A*\{e}. A natural operation of 
words is concatenation. Under this operation A* is a free monoid over A. The letters occurring 
in the word w form the alphabet of w denoted by fllph(w). From now on we assume that binary 
words are over the alphabet {0, 1}. For binary words we define the exchange operation: = 1 and 
1=0. Given a finite word w = a.\a.i ■ ■ ■ a n of n letters, we say that the length of w, denoted by 
\w\, is equal to n. By convention the length of the empty word is 0. We also denote by \w\ a the 
number of occurrences of the letter a in w . The set of all words of length n over the alphabet A is 
denoted A". 

An infinite word w over A is a function from the natural numbers to A. We consider such a 
function as a sequence indexed by the natural numbers with values in A. We write consicely 
w = flifl2 fl 3 " " " with a, 6 A. The set of infinite words is denoted by A". The infinite word w is 
said to be ultimately periodic if it can be written in the form w = uv w = uvvv ■ ■ ■ for some words 
u, v E A*, v 7^ e. If u = e, then w is said to be periodic. An infinite word which is not ultimately 
periodic is said to be aperiodic. 

A finite word u is a factor of the finite or infinite word w if it can be written that w = zuv for 
some z 6 A* and v E A* U A a '. If z = e, the factor u is called a prefix of w. If v = e, then we 
say that u is a suffix of w. The set of factors of w is denoted by The set J n (p) is defined to 

contain all factors of w of length n. A set of words X is factorial if every factor of w is a member 
of X for all w E X. If w = a\Hi ■ ■ ■ a n , then we denote w[i,j] = a, ■ ■ ■ ■ ai whenever the choices 
of positions i and j make sense. This notion is extended to infinite words in a natural way. An 
occurrence of u in w is such a position i, that w[i,i + \u\ — 1] = u. If such an index exists, we say 
that u occurs in w. If w has exactly one occurrence of u, then we say that u is unioccurrent in w. We 
say that a position i introduces a factor u if w[i — \u\ + l,i] = u, and u is unioccurrent in w[l, i]. 
A complete first return to the word u means a word starting and ending with u, and containing 
exactly two occurrences of u. A word which is a complete first return to some word is called a 
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complete return word. 

The reversal w of w = a\ai ■ ■ ■ a n is the word w = a n ■ ■ ■ aia\. If w = w, then we say that w 
is a palindrome. By convention the empty word is a palindrome. The set of palindromes of w is 
denoted by Tal{w). Moreover we define ( Pal n {w) — TaC(w) n fnip)- 

Let w = au where a G A and u G A*. We define the circular shift operation T as follows: 
T(zv ) = Wfl. By applying this shift operation repeatedly we obtain at most \w\ distinct words, 
called the conjugates of w. 

Let A and B be two alphabets. A morphism from A* to B* is a mapping / : A* — > B* such 
that f(uv) = f(u)f(v) for all words u,v G A*. Because of this morphic property the morphism 
/ is fully determined by its images on the letters. The morphism / is said to be non-erasing if for 
every a G A, f(a) G A + . A non-erasing morphism naturally extends to infinite words: for an 
infinite word w = fli«2 fl 3 ' ' ' / fi w ) — f { a \) f i a 2) f { a 3) ' ■ The morphism / is prolongable if there 
exists a letter a such that f{a) = aw for some w G A + . An infinite word w may be a fixed point 
of a morphism, i.e. f(w) = w. For a prolongable morphism / we have that f n (a) is a prefix of 
f n+1 (a) for alln > 0. Thus we obtain a unique fixed point f w (a) := lim n _5.oo /"(fl). 

Given an infinite word w over the alphabet A we say that a factor w of w is ng/if special (resp. 
left special) if wa and ub (resp. au and few) are both factors of w for some distinct letters a and b. A 
factor that is both right and left special is called bispecial. 

A set of binary words X is balanced if for all n > and every word u and v of X of length n 
it holds that ||u|i — < 1. A binary (finite or infinite) word is said to be balanced if its set of 
factors is balanced. 

3 Privileged Words 

Privileged words are a less known class of words which were recently introduced in [KLS11]. We 
define the set Tri^, the set of privileged words over A, recursively as follows: 

- e G Tri A , 

- a G Tri^ for every letter a in the alphabet, 

- if | ii> | > 2, then w G Tri^ if if is a complete first return to a shorter privileged word. 
When the alphabet is known from context, we omit the subscript A. Given a word w, we denote 

Tri(w) = {u G !F(w) : wis privileged}. 

The set !Pn„ (w) is defined to contain all privileged factors of w of length n. 
The first few binary privileged words are 

e, 0, 1, 00, 11, 000, 111, 010, 101. 

Not every privileged word needs to be a palindrome, for example the words 00101100 and 0120 
are privileged, but not palindromic. However privileged words and palindromes have some 
analogous properties, as we shall soon see. 

Lemma 3.1. Let wbe a privileged word, and u its any privileged prefix (respectively suffix). Then u is a 
suffix (respectively prefix) ofw. 
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Proof. If | if | < 1 or h = w, then the claim is clear. Suppose that \w\ > 2 and \u\ < \w\. By 
definition w is a complete first return to a shorter privileged word v. If | v \ < \u\, then by induction 
v is a suffix of u, and thus c would have at least three occurrences in w which is impossible. If 
u = v, then the claim is clear. Finally assume that \v\ > \u\, then by induction u is a suffix of 
v, and thus a suffix of w . The proof in the case that the roles of prefix and suffix are reversed is 
symmetric. □ 

The above Lemma is the first analogue to palindromes: a palindromic prefix of a palindrome 
occurs also as a suffix. 

Lemma 3.2. Let w be a privileged word, and u its longest proper privileged prefix (suffix). Then w is a 
complete first return to u. In other words the longest proper privileged prefix (suffix) of w is its longest 
proper privileged border. 

Proof. If \w\ < 1, then the claim is clear. Suppose that \w\ > 2, and that w is a complete first 
return to privileged word v. Now if \u\ > \v\, then v is a prefix of u, and thus by Lemma 3.1 also 
a suffix of u. Hence w has at least three occurrences of v, a contradiction. Therefore \u\ < \v\, 
and by the maximality of u, u = v, which proves the claim. The proof in the case that the roles of 
prefix and suffix are reversed is symmetric. □ 

Lemma 3.3. Let wbea privileged word, and suppose that it has border u. Then u is privileged. 

Proof. If \w\ < 1, the claim is clear. Suppose that \w\ > 2, and that if is a complete first return to 
privileged word v. Since v is the longest proper border of w, we may assume that \u\ < \v\. Now 
u is a prefix of v, and since u is a suffix of w and v is a suffix of w, also u is a suffix of v. Thus u is 
a border of v, and by induction, a privileged word. □ 

Palindromes share this property too: every border of a palindrome is a palindrome. 

The study of so-called rich words was initiated in [Gle+09]. Rich words are words having 
maximum number of distinct palindromic factors. In the following definition we count e as a 
palindromic factor. 

Definition 3.4. A word w is rich if it has exactly \w\ +1 distinct palindromic factors. An infinite 
word is rich if its every factor is rich. 

Next we state a useful characterization of rich words proven in [Gle+09]. 

Theorem 3.5. For any finite or infinite word w, the following properties are equivalent: 

(i) w is rich, 

(ii) every factor ofw which is a complete first return to a palindrome is itself a palindrome. □ 
The fact that the condition in the next Proposition is necessary was proved in [KLS11]. 

Proposition 3.6. Let wbea word. Then w is rich if and only ifPri(w) = ?Pa[(w). 

Proof. (=>) Suppose that the word w is rich. The claim is clear for factors u of length m < 1. 
Assume first that u, \u\ > 1, is privileged. By definition u is a complete first return to a shorter 
privileged word v. By induction v is a palindrome, and hence u is a complete first return to a 
palindrome, and is by Theorem 3.5 itself a palindrome. 
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Suppose then that u is a palindrome. Let v be the longest proper palindromic prefix of u. Now 
u is a complete first return to V. Otherwise u would have a proper prefix which is a complete first 
return to u, and by Theorem 3.5 this prefix would be a longer proper palindromic prefix of u than 
v. By induction it follows that v is privileged, and thus u too is a privileged word. 

(<=) Suppose now that Tri{w) = TaC{w) . Now let q be a complete first return to a palindrome 
p in w. By assumption p is privileged, and thus q too is privileged. Again by assumption q is a 
palindrome, and the claim follows from Theorem 3.5. □ 

As noted in [KLS11], privileged words are a "maximal generalization" of palindromes in the 
sense that every word is rich in privileged words, as is seen in Corollary 4.3. 

4 Privileged Words and Complete Return Factors 

Privileged words are special kind of complete return words. In this section we will prove that 
every word w has \w\ + 1 distinct privileged factors. We will also state a characterization of those 
words whose all complete return factors are privileged. This characterization has already been 
done in [FL12], but it seems that the authors missed the concept of privileged words, so we will 
briefly show the connection between privileged words and their work. 

The authors of [FL12] called complete return factors closed factors, but here we stick with more 
conventional vocabulary. In this section we count the empty word and the letters of the alphabet 
(as complete returns to the empty word) as complete return factors. 

Lemma 4.1. Let w E A + . Then every position ofw introduces at least one complete return factor ofw. 

Proof. Consider the position i of the word w, and the longest complete return factor v ending in i. 
Factor v exists since letters are complete return factors. We prove that v is unioccurrent in w[l, i], 
which proves the claim. Now if v had been introduced earlier, say at position j, then the factor 
w\j — \v\ +1, i] would be a complete first return to v contradicting the maximality of v. □ 

Corollary 4.2. [FL12] Every word w has at least \w\ + 1 complete return factors. □ 

A word w might have more than |zt>| +1 complete return factors (and most words do). Con- 
sider for instance the word w = l^Ol^O. Every position in w except the last introduces exactly one 
new complete return factor, but the last position introduces k + 1 new complete return factors, 
yielding a total of 3fc + 2 = \w\ + \ \w\ — 1 complete return factors in w. 

Corollary 4.3. Every word w E A* has exactly \ w\+l distinct privileged factors, i.e. every word is rich 
in privileged words. 

Proof. If we replace "longest complete return factor" with "longest privileged factor" in the proof 
of Lemma 4.1, we obtain that every position of w introduces at least one new privileged factor. 
Now if some position i would introduce two privileged factors, say u and v, with ju < \v\, then 
by Lemma 3.1 u would also be a prefix of v, i.e. it wouldn't be unioccurrent in w This is a 
contradiction, and thus every position of w introduces exactly one new privileged factor. □ 

From the proof we obtain the following facts: 
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Corollary 4.4. Let wbea word. If some position in w introduces exactly one complete return factor, then 
this factor is privileged. The word w has exactly \w\+l complete return factors if and only if its every 
complete return factor is privileged. □ 

In the article [FL12] words having the minimum number of complete return factors were 
considered. The authors called such words C-poor words. Since the minimum number of complete 
return factors in a word w is \w | + 1, we have that a word is C-poor if and only if its every 
complete return factor is privileged. We are ready to state a characterization of C-poor words. 

Proposition 4.5. [FL12] Let wbea word. Then the following are equivalent: 

(i) zv is C-poor, 

(ii) every complete return factor ofw is privileged, 

(Hi) w doesn't contain as a factor a complete first return to xyfor distinct letters x and y. □ 

It's worth noting that by part (Hi) C-poor words avoid all squares except squares of letters. If 
the word w is binary, then it can be said more: 

Proposition 4.6. [FL12] Let wbea binary word. Then the following are equivalent: 

(i) w is C-poor, 

(ii) every complete return factor ofw is a palindrome, 

(Hi) w is a conjugate of a word in 0*1*. □ 

Now part ( ii) of the above Proposition actually says that for a binary C-poor word w, Tri(w) C 
<Pat{w). We obtain that Tri(w) = TaC(w), since \w\ + 1 = \<Pri{w)\ < \TaC{w)\ < \w\ + 1. Thus 
by Proposition 3.6 the word w must be rich (as was observed in [FL12]). If the alphabet is larger 
than two letters, then not every C-poor word is rich: for instance the word 0120 is C-poor, but not 
rich. 

5 Privileged Complexity, Sturmian Words and Episturmian Words 

In the study of infinite words many different so-called complexity functions have been consid- 
ered. It's clearly of interest to try to count the number of distinct privileged words of length n 
occurring in a finite or infinite word w, that is, to figure out the privileged complexity of words. 

Definition 5.1. Let w be a finite or infinite word. The privileged complexity function which counts 
the number of distinct privileged factors of length n in w is defined as 

An(w) = \PrL„(w)\ 

for all n > 0. 
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5.1 Sturmian Words 



In this section we prove some basic results about the privileged complexity function function, 
and a characterization of Sturmian words using privileged complexity. First we need to discuss 
some related complexity functions. 

The factor complexity function C n {w) of the word w counts the number of distinct factors of w 
of length n, i.e. Cnip) — \7nip)\- We state the following well-known Theorem (for a proof see 
Theorem 1.3.13 of [Lot02]). 

Theorem 5.2. An infinite word w is aperiodic if and only if Cn(w) > n + \ for all n > 0, i.e w is 
aperiodic if and only if it has at least one right special factor of each length. □ 

Sturmian words are characterized by the fact that they are the simplest infinite aperiodic words 
in terms of the complexity function. They are defined as follows: 

Definition 5.3. An infinite word w is Sturmian if C n {w) = n + 1 for all n > 0. 

For more information about Sturmian words see the Chapter 2 of [Lot02]. The next two Propo- 
sitions are well-known (for proofs see Propositions 2.1.2 and 2.1.3 of [Lot02]). 

Proposition 5.4. Let X be a factorial set of words. If X is balanced, then |Xn{0,l}"| < n + lforall 
n>0. □ 

Proposition 5.5. Let X be a factorial set of words. IfX is unbalanced, then there exists a unique minimal 
unbalanced pair of the form (0x0, lxl) in X where the word x is a palindrome. □ 

Sturmian words are also characterized as follows (see Theorem 2.1.5 of [Lot02]): 

Theorem 5.6. An infinite binary word is Sturmian if and only if it's aperiodic and balanced. □ 

Sturmian words have numerous other characterizations. The characterization of interest here 
is the characterization in terms of the palindromic complexity function, due to X. Droubay and 
G. Pirillo [DP99]. The palindromic complexity function (P n {w) is defined as T n iw) = \'Pal n {w)\, it 
counts the number of distinct palindromes of length n in w. 

Theorem 5.7. [DP99] An infinite word w is Sturmian if and only it has palindromic complexity 



1, if n is even 

2, if n is odd 



for all n > 0. □ 

We say that the word w has the property Tp a \(w) if the word w satisfies the palindromic 
complexity of the above Theorem. We shall prove that the condition of this Theorem is sufficient 
after we have established a (similar) proof of Theorem 5.8. 

It's natural to ask what is the privileged complexity of Sturmian words, and if the answer to 
this question characterizes Sturmian words. This indeed is the case, and it's the main result of 
this section. 
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Theorem 5.8. An infinite word w is Sturmian if and only if it has privileged complexity 

!1, if n is even 
2, if n is odd 

for all n > 0. 

The proof of this theorem is based on two Lemmas 5.10 and 5.22. To simplify notations, we 
say that the word w has the property JST(w) if the word iv satisfies the privileged complexity of 
the above Theorem. 

We need the following result. After we have proved Lemma 5.22, we will provide a proof of 
this result. For another proof see Corollary 4 of [DJP01]. 

Theorem 5.9. Sturmian words are rich. □ 
Lemma 5.10. Sturmian word w has the property JS'P{w). 

Proof. By Theorem 5.9 the Sturmian word w is rich. Next, Proposition 3.6 says that Tri(w) = 
<PaC{w), and hence by Theorem 5.7 the word w has the property JS&{w) ■ □ 

For the purpose of proving the converse of Lemma 5.10 we introduce the concepts of Q- 
property and Q-f actors of words. 

Definition 5.11. A Q-property of words is defined to satisfy the following conditions: 

- Q(e) and Q(a) hold for all letters a e A, 

- for every position in every word there exists a factor with property Q ending at said posi- 
tion, 

- every position in every word introduces at most one factor with property Q. 
Factors with property Q are called Q-f actors. 

Definition 5.12. The Q-complexity function of a word w is defined as 

9#{w) = \{ue!fn{w):Q{u)}\. 
For a finite word w we define tt®(w) = \{u e ?(w) : Q(u)}\. 

The following Lemma follows easily from the definition. 
Lemma 5.13. Every word has at most \w\ + 1 distinct Q-factors, i.e. 9{Q{w) < \w\ + 1. □ 

It's well-known (see Proposition 2 of [DJP01]) that every position in every word introduces 
at most one new palindrome. Hence "being a palindrome" is a Q-property. From the proof of 
Corollary 4.3 we see that every position in every word introduces exactly one new privileged 
factor, and thus "being a privileged word" is a Q-property. Third possible Q-property could be 
"being a power of a letter". 

Lemma 5.14. Let w G A + be a finite word with \Mph~(w) | > 2. Then 9$(w) = Qfor some 2 < n < 
\w\. 
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Proof. We may assume that w has exactly + 1 distinct Q-f actors, since otherwise clearly 
9${w) = for some 2 < n < \w\. Now if Oi^(w) > for all 1 < n < \w\, then since we 
have \w\ positions in w, every position needs to introduce a distinct Q-f actor of different length. 
This however is impossible since by assumption \Mph(w) | > 2 positions introduce a Q-factor of 
length one. □ 

Definition 5.15. If 9${w) = Ofor some integer n, then we say that n is a vanishing index ofw. 

When it's said that n is a vanishing index in the last y-block of y m in the next Lemma, we 
mean that n is a vanishing index of y m , and that (m — 1) |y| < n < m\y\. 

Lemma 5.16. Let w = y w be a periodic infinite word. Then either 9-(®{w) = Ofor infinitely many n or 
there exists such k that 9^{io) = 1 for all n > k. 

Proof. Let r = |y|. If there are infinitely many vanishing indices, then the claim is clear. Assume 
that there are only finitely many vanishing indices. Let m be such an integer that the last vanish- 
ing index n is in the last y-block of y m . If no such integer n exists, we set m = and n = — 1. Now 
concatenating y to y m introduces at most r new Q-factors of different length since every position 
can introduce at most one new Q-factor. Now there might be some vanishing indices in the last 
y-block of y m+1 . However when the next y is concatenated to y m+1 it must be that 9{Q (y m+2 ) > 
for alln + l<z<(m + l)r since if 9{Q(y m+1 ) = for some n + 1 < ; < (m + l)r, then since 
adding more y's to the end of y m+2 doesn't introduce any new factors of length j, it would be that 
9{9-(w) = contradicting the maximality of n. 

Now if there are s vanishing indices in the last y-block of y" ,+1 , then there are at least s vanish- 
ing indices in the last y-block of y m+2 . Otherwise concatenating y to y m+1 would have introduced 
more than r Q-factors of different length which is impossible. It could be that the number of van- 
ishing indices in the last y-block of y m+2 increases, but such a phenomenom can occur at most r 
times. Hence there exists an integer m' such that the last y-block of y m +d has s' vanishing indices 
for all d > 0. We claim that ^(w) = 1 for all i > m'r, which proves the claim. Clearly by the 
maximality of n ?{p(w) > 1 for all i > m'r. Now concatenating y to y m " for m" > m' must intro- 
duce r new Q-factors of different length, since otherwise the number of vanishing indices in the 
last y-block of y m +1 would increase. Now if 9{Q{w) > 2 for some m'r < j, then concatenating 
y to y m for some m" > m' would introduce at least two new Q-factors of the same length or 
it would introduce at least one Q-factor of some length already introduced. This is impossible, 
since such a concatenation would introduce less than r Q-factors of different length. □ 

Corollary 5.17. Let w be an ultimately periodic infinite word. Then either ^p(ro) = for infinitely 
many n or there exists such k that !}{®(w) = lfor all n >k. 

Proof. Let w = xy w '. Adding the prefix x to y a ' introduces only finitely many new Q-factors. 
Hence the claim follows from Lemma 5.16. □ 

As a consequence of the above Corollary and the discussion after Lemma 5.13 we have the 
following two Corollaries. 

Corollary 5.18. An infinite word w having the property JS'P{w) is aperiodic. □ 
Corollary 5.19. An infinite word w having the property T Pa i (w) is aperiodic. □ 



9 



However there exist aperiodic infinite words w having Tri n (w) = for infinitely many n. 
One example is the Thue-Morse word. See Proposition 6.3. 

The following Lemma is a technical result taken out of the proof of Lemma 5.22 for clarity. 

Lemma 5.20. Let w be a binary word and u its palindromic factor. Assume that zv starts with ua and 
ends with au for some letter a, and doesn't contain aua or aua as factors. Moreover assume that u has 
exactly two complete return words in w which are both palindromes, one starting with ua and the other 
starting with ua. Then w is a palindrome. 

Proof. If u = e, then it's straightforward to prove that w is a palindrome. Assume that u ^ e. If 
w is a complete first return to u, then by the assumptions, w must be a palindrome. If this is not 
the case, then u has as a prefix and as a suffix a complete return to u, say v and v 1 respectively. 
Since u has exactly one complete return word starting with a, which is a palindrome, it must be 
that v — v' . If these occurrences of v in w overlap or w = v 2 , then w is a palindrome. Thus we 
may assume that w = vw'v, with w' =^ £. Since au is a suffix of v, and ua a prefix of v, and since 
aua isn't a factor of w, it must be that w' starts and ends with letter 8., If w' — a, then w is a 
palindrome. Thus we may write that w 1 = aw" a. Moreover it can be written that v = au = ua., 
and w = auaw"aua. Consider the word z = uaw"au. This word z now satisfies the premises of 
the claim with letters exchanged. Thus by induction and using a symmetric argument we obtain 
that z is a palindrome, and thus w itself is a palindrome. □ 

We are now almost ready to prove the converse of Lemma 5.10. The proof is similar to the 
proof of Theorem 5.7 in [DP99]. To simplify notations, we define for a word w the following 
properties: 

- %ch n {w) ^=^> Pri n (w) = <PaC n {w), 

- Spc n (w) there exists a unique right special factor of length n — 1, 

- 'Bal n (w) •<=>■ all factors of w of length n are balanced, 

- < R&V n (w) for each factor u of length n, also u is a factor of w. 

Thus by definition an infinite word w is Sturmian if and only if Spe n (w) is satisfied for all n > 0. 
We need the following well-known result. It's easy to deduce using so-called Rauzy graphs, see 
[VuiOl] for more details. 

Lemma 5.21. Let w be an infinite word, and suppose that Spt m (w) holds for m < n. If u is the right 
special factor ofw of length m < n, then u has exactly two complete return words. □ 

Lemma 5.22. An infinite word w having the property JS^P(w) is Sturmian. 

Proof. First of all since 2L\ (w) = 2, fPrij (w) = {0, 1} and the word w is thus binary. Hence £ is the 
unique right special factor of length 0. Clearly %ch^ (w),Spe 1 (w), Halx (zv) and 'Rg-Vi (zv) hold. We 
will next assume that %cfi. n (w),Spe- {w), < Bcd n {w) and ( Rg.v n {w) hold for n > 1, and prove that w 
satisfies all these properties for n + 1. This proves the claim. 

Case 1. H^cH n+1 (zv) 

Let x G (Pri n+ i(zu). Now the word x is a complete first return to a privileged word u. By the 
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induction hypothesis u is a palindrome. Now if u overlaps with itself in x or x = u , also x must 
be a palindrome. Thus we may assume that x = uXu for some A ^ £. If \u\ = 1, then x = ab"a 
for some letters a and b, and x is a palindrome. Assume further that \u\ > 2. Now m = am'a 
for some letter a and palindrome u', so x = au'aXau'a. Denote z = u'aXau 1 . We prove that z 
is a palindrome. Now if z is a complete first return to u' , which is privileged by the induction 
hypothesis, then z is privileged, and a palindrome. Suppose that this is not the case. Now z has 
as a prefix and a suffix a complete first return to u', say v and v' respectively. Now v and v' can't 
overlap since otherwise z would contain au'a as a factor. For the same reason v must be followed 
by the letter a. Thus u' has at least two palindromic complete return words: one starting with 
u'a and another starting with u'a. However by the induction hypothesis Spe m (zv) holds for all 
m < n, and by applying Lemma 5.21, we have that u' has exactly two complete return words. 
Moreover by the induction hypothesis t Bal m iw) holds for all m < n, and thus z can't have au'a as 
a factor. Now by applying Lemma 5.20 we obtain that z is a palindrome. Now in any case z is a 
palindrome, and thus also x is a palindrome. Hence !Pri n+ i C Tal n+ \(w) . 

Let then x £ (Pa[ n +\(w) and u its longest proper border. Now u must be a palindrome, and 
hence by the induction hypothesis, a privileged word. The word x must be a complete first 
return to u, since otherwise there would be a privileged proper prefix v longer than u. That 
would be a contradiction with the maximality of u, since v would also be a palindrome by the 
induction hypothesis. Therefore x is privileged, and Tat n+ \(zv) C Tri n+ i(zv). 

Case 2. < BaL n+ \(w) 

Assume on the contrary that l Bal n+ i (w) doesn't hold. Then by Proposition 5.5 there exists a palin- 
dromic factor x such that 0x0, lxl £ !F n +i{ w )- By Case 1 we know that Tri n+ i{w) = !PaC n+ i(w), 
and since palindromes 0x0,1x1 £ ^Pa[ n+ i(iu) , by assumption, n + 1 is odd, and hence also 
w — 1 is odd. Again by assumption ¥a[ n -i(zv) = {x,t} where x ^ t. Moreover x is a bispecial 
factor of length n — 1. Since Spe n _ 1 (zv) holds, we conclude that t isn't right special. Now 
ta G 7nip) for some letter a. By the property %ev n {w), also at £ ^(w). Since f isn't right 
special, a to £ ^F„ + i(tf). Thus {0x0, lxl, afa} C 5 , a4 + i(zy) = (Pri„ + i (zv), contradicting the fact 
that Stn + i(w) = 2. 

Case 3. Spe n+1 {zv) 

Assume on the contrary that Spe n+1 {zv) doesn't hold. Then zv must have at least two right special 
factors of length n or zero right special factors of length n. In the latter case zv is ultimately 
periodic by Theorem 5.2. This is however a contradiction with Corollary 5.18. Thus w has at 
least two right special factors of length n, say u and v. Let z be the longest common suffix of u 
and v, i.e. u = u'az and v = v'az for some letter a. Since u and v are right special, aza, aza £ !F(zv) 
contradicting l BaL m (iv) for some m < n + 1. 

Case 4. 

Denote f n {w) = {w : w £ 5 n (a?)}. Consider the set X = U"^ u ?i{ w )- This set is bal " 

anced since otherwise by Proposition 5.5 there would exists palindromes 0x0, lxl £ X, and hence 
(since these words are palindromes) 0x0, lxl £ ^F ffl (zv) for some m < n + 1. This is a contradiction 
with the induction hypothesis or the Case 2. Thus by Proposition 5.4 |X n {0, 1}" +1 1 < n + 2. On 
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the other hand by Theorem 5.2 |X n {0, 1}" +1 1 > C„ +1 (w) = n + 2. Thus |X n {0, 1}" +1 = n + 2, 
and it must therefore be that Tn+l { w ) = 7n+l { w ) which means that %g.V n ^\ (w) is satisfied. □ 

Lemmas 5.10 and 5.22 establish Theorem 5.8. The proof of Lemma 5.22 with minor modifi- 
cations proves too that an infinite word W having the property Tp a i{w) is Sturmian: the Case 1 
is omitted, and the Case 2 needs to be slightly adjusted. Otherwise the proof can be kept intact, 
Corollary 5.19 ensures that an infinite word having property Tp a i(w) must be aperiodic. 

The proof of Lemma 5.22 gives us also a proof of Theorem 5.9. 

Proof of Theorem 5.9. Now a Sturmian word w has the properties l Bal n {w) and Spe {w) for all 
n > 0. As in the Case 1 of the proof of Lemma 5.22, we obtain that w has the property %ch n (w) 
for all n > 0. In other words, Tri{w) = Tat{w), and from Proposition 3.6 it follows that w is 
rich. □ 

Note that not every Q-complexity function characterizes Sturmian words. Take Q to be the 
property "being a power of a letter". Now for a Sturmian word w either 00 £ !F(zv) or 11 £ !F(w) . 
By symmetry assume that 11 i J-{w). It's also well-known that there exists a k such that 0^ £ 
J-(w), but k+1 i !F(w). Hence the Q-complexity of w would be 

{1, if n = or 2 < n < k 
2, ifn = l 
0, otherwise. 

However for instance the non-Sturmian word (0 l) w has the same Q-complexity function. 
5.2 Episturmian Words 

We conclude Section 5 by considering briefly the privileged complexity of so-called episturmian 
words, which are a generalization of Sturmian words to arbitrary alphabet. Episturmian words 
have a rich theory, for more about these intriguing words, see the foundational paper [DJP01] 
and the survey [GJ09]. 

Definition 5.23. An infinite word w over alphabet A (\A\ > 2) is episturmian if w has the property 
%tv n (w) and has at most one right special factor of length n for all n > 0. 

Definition 5.24. Episturmian word w is A-strict if for each n > there exists a unique right 
special factor u of length n and ua £ ^F(w) for all a £ A. 

Actually A-strict episturmian words over the alphabet A are exactly the so-called Arnoux- 
Rauzy words over A. Note that A-strict episturmian words are aperiodic, and if A = {0, 1}, then 
these words are exactly the Sturmian words. 

In the Corollary 2 of [DJP01] it was proved that episturmian words are rich. Therefore we 
may proceed as we did with Sturmian words: the palindromic complexity of episturmian words 
gives us their privileged complexity by Proposition 3.6. 

The following is Theorem 4.4 in [JP02]. 
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Theorem 5.25. [JP02] Let wbea A-strict episturmian word over the alphabet A. Then w has palindromic 
complexity 



1, if n is even 
\A\, if n is odd 



for all n>0. □ 

Thus we have the following result: 

Theorem 5.26. Let zv be a A-strict episturmian word over the alphabet A. Then w has privileged com- 
plexity 



1, ifn is even 
\A\, ifn is odd 



for all n>0. □ 

However the privileged complexity of A-strict episturmian words doesn't characterize them 
when | A | > 2. Words coding r-interval exchange transformations are a class of rich words which 
satisfy the palindromic complexity of Theorem 5.25 [BMP07]. Being rich they also have the same 
privileged complexity as episturmian words. However words coding r-interval exchange trans- 
formations are not episturmian when r > 2 (here r is the number of letters). One example of such 
a word is the fixed point of the following morphism 

a h-> c, 
a : b i—)- ca, 
c i — ^ caba. 

The fixed point isn't episturmian since both letters a and c are right special, but it satisfies the 
complexities of Theorems 5.25 and 5.26. 

Actually not even both factor complexity and privileged complexity of A-strict episturmian 
words characterizes them since words coding r-interval exchange transformations have the same 
factor complexity as episturmian words [BMP07]. 

6 Privileged Complexity and the Thue-Morse Word 

In this section we investigate briefly the privileged complexity of the Thue-Morse word. The 
infinite Thue-Morse word t is defined as the fixed point of the morphism q>: 

i-> 01, 
r 1 ^ 10. 

For more information about the Thue-Morse word, see Chapter 2 of [Lot83]. The word t has the 
following well-known property: 

Theorem 6.1. The Thue-Morse word t is overlap free. □ 
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Lemma 6.2. Let wbea non-empty even length privileged factor oft. Then 00 or 11 is a factor ofw. 

Proof. Using the fact that t is overlap free, it can be easily shown that no factor of t of length 
greater than four avoids factors 00 and 11. The only possible privileged factors of length two are 
00 and 11. For privileged factors of length four, the possibilities are 0000, 1111, 0110 and 1001. 
Words 0000 and 1111 are not factors of t, but anyway the claim is proved. □ 

Proposition 6.3. The infinite Thue-Morse word t doesn't have any privileged factors of length n, when n 
is odd and n > 5. 

Proof. Let w be an privileged factor of t of odd length, which is a complete first return to a privi- 
leged word u. Denote x = 01 and y = 10. 

Assume first that \u\ is odd. Since 000 and 111 are not factors of t, it must be that \u\ > 1. 
Moreover \w\ > 5, since if \w\ =5, then the occurrences of u in w would need to overlap, and 
t is overlap free. We need to only prove that u can't be 010 or 101 (the privileged factors of t of 
length three). We prove that u can't be 010, the other case is symmetric. Assume first that the 
factorization of w over {x, y} matches from the beginning of w. So we are looking for a factor of 
t starting and ending with xx, and containing no internal occurrences of xx = 0101 or yy — 1010. 
Using the fact that t is overlap free, one can by inspection deduce that the only possibility is 
xxyxx. However xxyxx is not a factor of t. Assume then that the factorization over {x, y} doesn't 
match from the beginning. Now if u = 010, then u must be preceded by 1 in t. Thus we would 
have found a complete first return to 101, say v, of length \w\, and the factorization of v over 
{x, y} would match from the beginning. Earlier it was proved that such a factor v can't exist. 

Assume then that \u\ is even. By Lemma 6.2 u contains 00 or 11 as a factor, say it contains 00. 
Suppose that 00 occurs at an even position in the prefix v of w. Then since |a?| is odd, it must 
be that 00 occurs at an odd position in the suffix v. Therefore w doesn't match any factorization 
over {x, y}. If 00 occurs at an odd position in the prefix v, one arrives at a contradiction using a 
symmetric argument. □ 

Now it's also true that the Thue-Morse word doesn't contain any odd palindrome of length 
greater than three (for a proof see [BBL08]). However the number of even length privileged 
factors can't be calculated in the same way as we did earlier with Sturmian words, since the 
Thue-Morse word isn't rich. For instance the following factor of t is not rich: 11010011. 

The case of even length privileged factors is more complicated. So far it's not known to the 
author how to evaluate the number of even length privileged factors in t. In the next table there 
are some values for J^j if) for even n. The results are based on a computer search. 



2-10 


12-20 


22-30 


32-40 


42-50 


52-60 


62-70 
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14 
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14 
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4 





8 


6 


2 





2 


8 


2 
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2 
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8 
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There are interesting gaps of zeros in J3n(f). For instance %n(t) =0 for 81 < n < 85, 113 < 
n < 117, 145 < n < 149, 177 <n < 181 and 189 < n < 257. Based on the computer searches and 
an educated guess, we state the following conjecture: 

Conjecture. There exist arbitrarily long (but not infinite) gaps of zeroes in the values of &„{t). 
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